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Abstract 

A Central Limit Theorem for non-commutative random variables is proved using the Lindeberg method. 
The theorem is a generalization of the Central Limit Theorem for free random variables proved by Voiculescu. 
The Central Limit Theorem in this paper relies on an assumption which is weaker than freeness. 



1 Introduction 

One of the most important results in free probability theory is the Central Limit Theorem (CLT) for free 
random variables ( ifTTl ). It was proved almost simultaneously with the invention of free probability theory. 
Later conditions of the theorem were relaxed ( iflOl ). Moreover, a far-reaching generalization was achieved 
in 0], which studied domains of attraction of probability laws with respect to free additive convolutions. 
See also 0. 

Freeness is a very strong condition imposed on operators and it is of interest to find out whether the 
Central Limit Theorem continues to hold if this condition is somewhat relaxed. This problem calls for a 
different proof of the non-commutative CLT which does not depend on R-transforms or on the vanishing of 
mixed free cumulants, because both of these techniques are closely connected with the concept of freeness. 

In this paper we give a proof of free CLT that avoids using either R-transforms or free cumulants. This 
allows us to develop a generalization of the free CLT to random variables that are not necessarily free but that 
satisfy a weaker assumption. An example shows that this assumption is strictly weaker than the assumption 
of freeness. 

The proof that we use is a modification of the Lindeberg proof of the classical CLT (|6|). The main 
difference is that we use polynomials instead of arbitrary functions from (M) , and that more ingenuity is 
required to estimate the residual terms in the Taylor expansion formula. 

The closest result to the result in this paper is Theorem 2. 1 in ( lfl2l ). where the Central Limit Theorem is 
proved under the conditions on summands that are weaker than the requirement of freeness. The conditions 
that we use are somewhat different than those in Voiculescu's paper. In addition, we give an explicit example 
of variables that are not free but that satisfy conditions of the theorem. 

The rest of the paper is organized as follows. Section 2 provides background material and formulates 
the main result. Section 3 shows by an example that a condition in the main result is strictly weaker than the 
condition of freeness. Section 4 contains the proof of the main result. And Section 5 concludes. 
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2 Background and Main Theorem 



Before proceeding further, let us establish the background. A non- commutative random space {A, E) is a 
pair of an operator algebra A and a linear functional E on A. It is assumed that A is closed relative to taking 
the adjoints and contains a unit, and that E is 

1) positive, i.e., E (X*X) > for every X e A, 

2) finite, i.e., E(I) = 1 where I denotes the unit operator, and 

3) tracial, i.e., E (Xi X 2 ) = E (X 2 Xi ) for every X] and X 2 G A. 

This linear functional is called expectation. Elements of A are called random variables. 
Let X be a self-adjoint random variable (i.e., a self-adjoint operator from algebra A). We can write X as 
an integral over a resolution of identity: 



poo 

x = 



AdPx (A) , 



where P x (A) is an increasing family of commuting projectors. Then we can define the spectral probability 
measure of interval (a, b] as follows: 

M(a,b]} = E[P x (b)-Px(a)]. 

We can extend this measure to all measurable subsets in the usual way. We will call \i x the spectral proba- 
bility measure of random variable X, or simply its spectral measure. 

We can calculate the expectation of any summable function of a self-adjoint variable X by using its 
spectral measure: 

poo 

Ef(X)= f(A)dn x (A). 

J — OO 

In particular, the moments of the probability measure |j. x equal the expectation values of the powers of X: 

A k d Hx (A) = E (X k ) . 

Let us now recall the definition of freeness. Consider sub-algebras Ai , A n - Let at denote elements 
of these sub-algebras and let k (i) be a function that maps the index of an element to the index of the 
corresponding algebra: at e -4k(i) . 

Definition 1 The algebras Ay A n (and their elements) are free z/E (ai ...a m ) = whenever the follow- 
ing two conditions hold: 

(a) E ( a-i ) =0 for every i, and 

(b) k(i) ^ k (i + 1 ) for every i < m. 

The variables Xi , X n are called free if the algebras Ai generated by {I, Xt, X*} , respectively, are free. 

An important property of freeness is that we can compute the moments of the products of the free random 
variables. 

Proposition 2 Suppose X] ,...,X n are free. Then 

n 

E(Xi...X n )=£; Y. Hr , E(X kl )..E(X kr )E(x,...X kl ...X kr ...X n ), (1) 

r=1 1 <k, <...<k, <n 

where A denotes terms that are omitted. 
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This property is easy to prove by induction. However, we will not need all the power of this property. 
Below we formulate the conditions that we need to impose on the random variables to prove the CLT. These 
conditions are consequences of freeness but are likely to be weaker. 

We will say that a sequence of zero-mean random variables Xi X n , ... satisfies Condition A if: 

1. For every k, E (X k Xi, ... Xi r ) = provided that i s ^ k for s = 1 , r. 

2. Foreveryk>2,E(x£Xi,... X ir ) = E (x£) E (Xi, ... X ir ) provided that i s < k fors = 1,...,r. 

3. For every k > 2, 

E (X k Xi, ... X ip X k Xi p + , ... X ir ) = E (X k ) E (X tl ... X ip ) E (X ip+ , ... X ir ) 
provided that i s < k for s = 1 , r. 

Intuitively, if we know how to calculate every moment of the sequence Xi , X k _i , then using Condi- 
tion A we can also calculate the expectation of any product of random variables Xi , X k that involves no 
more than two occurrences of variable X k . Part 1 of Condition A is stronger than is needed for this calcula- 
tion, since it involves variables with indices higher than k. However, we will need this additional strength in 
the proof of Lemma[T3lbelow, which is essential for the proof of the main result. 

Proposition 3 Every sequence of free random variables X] , X n , ... satisfies Condition A. 

This proposition can be checked by direct calculation using Proposition^ 
We will also need the following fact. 

Proposition 4 Let Xi , Xi be zero-mean variables that satisfy Condition A( 1 ), and let , Y n be 
zero-mean variables which are free from each other and from the algebra generated by variables Xi , X;. 
Then the sequence Xi , Xi, Yi + i , Y n satisfies Condition A( 1 ). 

Proof: Consider the moment E (X k Ai, ...Ai s ] , where Ai t is either one of Yj or one of Xi but it can 
equal X k . Then we can use the fact that Yj are free and write 

E (X k Ai, ...Ai s ) = ^_ c a E (XkX,., ( a )...Xi T ( a )) , 

a 

where none of Xt t ( a ) equals X k . Then, using the assumption that Xi satisfy Condition A(l), we conclude 
that E (X k Ai, ...A is ) = 0. Also, E (Y k Ai, ...A is ) = E (Y k ) E [A u ...A is ) = 0, provided that none of A it 
equals Y k . In sum, the sequence Xi , Xi, Y; + i , Y n satisfies Condition A(l). QED. 

While the freeness of random variables Xi is the same concept as the freeness of the algebras that they 
generate, Condition A deals only with variables Xi, and not with the algebras that they generate. For exam- 
ple, it is conceivable that a sequence {Xi} satisfies condition A but {X? — E (X?) } does not. In particular, 
this implies that Condition A requires checking a much smaller set of moment conditions than freeness. 
Below we will present an example of random variables which are not free but which satisfy Condition A. 

Recall that the standard semicircle law ]x s c is the probability distribution on R with the density n~ 1 V4 — x 
if x € [—2; 2] , and otherwise. We are going to prove the following Theorem. 
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Theorem 5 Suppose that 

(i) {E,i} is a sequence of self- adjoint random variables that satisfies Condition A; 

(ii) every has asbsolute moments of all orders, which are uniformly bounded, i.e., E |£,-jj k < [i^for all i; 
(Hi) Ef,i = 0, E£| = of ; 

(iv) (erf + ... + (T^j) /N — > s a* N — > oo. 

77;en f/ie spectral measure o/Sn = (£,1 + ... + E, N ) / ycrf + ... + converges in distribution to the semi- 
circle law [i s c . 

The contribution of this theorem is twofold. First, it shows that the semicircle central limit holds for a 
certain class of non-free variables. Second, it gives a proof of the free CLT which is different from the usual 
proof through R-transforms. However, it is not stronger than a version of the free CLT which is formulated 
in Section 2.5 in [ 10|. 



3 Example 

Let us present an example that suggest that Condition A is strictly weaker than the freeness condition. 

Let F be the free group with a countable number of generators fk. Consider the set of relations R = 
{fkfk-i f kf k-i f kf k-i = e} , where k > 2, and define G = T/1Z, that is, G is the group with generators fk 
and relations generated by relations in R. 

Here are some consequences of these relationships: 

1) fk-1 fkfk-1 fkfk-1 fk = 6. 

(Indeed, e = 1 (f k fk-i ffcfk-1 fkfk-1 ) fk = fk-1 f kf k-i f kf k-i fk-) 
^fkVk^k-i^fkVk 1 ^andf^Vk^kVk^i =e. 

We are interested in the structure of the group G. For this purpose we will study the structure of 1Z, 
which is a subgroup of F generated by elements of R and their conjugates. We will represent elements of 
F by words, that is, by sequences of generators. We will say that a word is reduced if does not have a 
subsequence of the form fkf^ 1 or f^ 1 fk- It is cyclically reduced if it does not have the form of fk...fk 1 or 
f^ 1 ...fk- We will call a number of elements in a reduced word w its length and denote it as |w| . A set of 
relations R is symmetrized if for every word r S R, the set R also contains its inverse r _1 and all cyclically 
reduced conjugates of both r and r _1 . 

For our particular example, a symmetrized set of relations is given by the following list: 

fkfk-1 fkfk-1 fkfk-1 , fk-1 fkfk-1 fkfk-1 fk, \ 
x— 1 x— 1 x — 1 x— lx— 1 x— 1 x— 1 X— 1 x— 1 X— 1 x— 1 x— 1 f ' 
T k-1 T k T k-1 T k T k-l T k ' T k T k-l T k T k-l T k T k-1 J 

where k are all integers > 2. 

A word b is called a piece (relative to a symmetrized set R) if there exist two elements of R, Ti and rz, 
such that ri = bci and T2 = bc2- In our case, each fk and f^ 1 with index k > 2 is a piece because fk 
is the initial part of relations fkf k-i f kfk-i f kf k-i and f kf k+i f kf k+i f kf k+i , and f^ is tne initial part of 

relations f^ 1 f^ 1 fkli fk 1 f k-i and f k 1 f kli f k 1 f k+i f k 1 f kli • There is no other P ieces - 
Now we introduce the condition of small cancellation for a symmetrized set R: 

Condition 6 (C (A)) Ifr e R andr = be where b is a piece, then |b| < X\r\ . 
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Essentially, the condition says that if two relations are multiplied together, then a possible cancellation 
must be relatively small. Note that if R satisfies C (A) then it satisfies C (p.) for all |_l > A. 
In our example R satisfies C (1 /5) . 
Another important condition is the triangle condition. 

Condition 7 (T) Let x-\ , Xi , and r$ be three arbitrary elements o/R such that Xi 7^ t^ 1 and x$ 7^ 1 Then 
at least one of the products x-\ X2, X2X3 , or X3X1 , is reduced without cancellation. 

In our example, Condition (T) is satisfied. 

If s is a word in F, then s > AR means that there exists a word r 6 R such that r = st and |s| > A |r| . An 
important result from small cancellation theory that we will use later is the following theorem: 

Theorem 8 (Greendlinger's Lemma) Let R satisfy C (1 /4) and T. Let w be a non-trivial, cyclically re- 
duced word with wgK. Then either 

(1) w e R, 

or some cyclycally reduced conjugate w* ofw contains one of the following: 

(2) two disjoint subwords, each > |R, or 
(4) four disjoint subwords, each > jR. 

This theorem is Theorem 4.6 on p. 251 in Q. 

Since in our example R satisfies both C' (1/4) and T, we can infer that in our case the conclusion of the 
theorem must hold. For example, (2) means that we can find two disjoint subwords ofw, si and S2, and two 
elements of R, T] and X2 , such that = stti and |s-jj > (3/4) |rt| = 9/2. In particular, we can conclude that 
in this case |w| > 10. Similarly, in case (4), |w| > 16. One immediate application is that G does not collapse 
into the trivial group. Indeed, f i are not zero. 

Let L 2 ( G ) be the functions of G that are square-summable with respect to the counting measure. G acts 
on L 2 (G) by left translations: 

(L g x)(H) =x(gh). 

Let A be the group algebra of G. The action of G on L 2 (G) can be extended to the action of A on L 2 (G) . 
Define the expectation on this group algebra by the following rule: 

E(h) H6 e ,L H 6 e ), 

where (• , •) denotes the scalar product in L 2 ( G ) . Alternatively, the expectation can be written as follows: 

F_ (h) = a e , 

where b-=^ geG Q g gisa representation of a group algebra element h. as a linear combination of elements 
g G G. The expectation is clearly positive and finite by definition. It is also tracial because gi g2 = e if and 
only if g 2 gi = e. 

If Lh = Y. g e G a 9 '-s ^ s a l mear operator corresponding to the element of group algebra h — ^ ge g CL g g, 
then its adjoint is (Lh)* = Y. g eG ^H-g^ 1 > which corresponds to the element h* = ^g^G^-gS 1 - 

Consider elements Xi = f i + f^ 1 . They are self-adjoint and E (Xi) = 0. Also we can compute E (X?) = 
2. Indeed it is enough to note that f 2 7^ e and f^ 2 7^ e, and this holds because insertion or deletion of an 
element from R changes the degree of ft by a multiple of 3. Therefore, every word equal to zero must have 
the degree of every ft equal to modulo 3. 

Proposition 9 The sequence of variables {Xi} is not free but satisfies Condition A. 
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Proof: The variables Xk are not free. Consider X2X1X2X1X2X1 .Its expectation is 2, because izi-\iii-\ — 
e and 1 f fj 1 f 7 1 ^2 1 ^ 1^ = e, and all other terms in the expansion of X2X1X2X1X2X1 are different from 
e. Indeed, the only terms that are not of the form above but still have the degree of all ft equal to zero modulo 
3 are f2ff 1 f2ff 1 fiff 1 and 1 f 1 f 2 1 f 1 1 f 1 > but they do not equal zero by application of Greendlinger's 
lemma. Therefore, E (X2X1 X2X1 X2X1 ) = 2. This contradicts the definition of freeness of variables X2 and 
Xi. 

Let us check Condition A. For A(l), we have to prove that E (X^Xt, ...Xi n ) = 0, where k 7^ i s and i s 7^ 
i s +i for every s. Consider E (fkfi, ...fi^ ), where k 7^ i s and i s 7^ i s +i for every s. Note fk.fi, ...fi n 7^ e, as 
can be seen from the fact that the degree of fk does not equal zero modulo 3. Therefore E (fkfi, •••fi n ) = 0. 
A similar argument works for E (f^ 1 fi, ...fi n ) = and more generally for the expectation of every element 
of the form f^f^ 1 •••ft a 2 , where e = ±1 and n s are integer. 

Similarly, we can prove that E (f^ 2 ^ 1 •••fJ X2 ) = an d this suffices to prove A(2). 

For A(3) we have to consider elements of the form f^ 1 fi, ■■■h p ^l 2 h p + , •••fi,- Assume that neither 
fi, ...fi p nor f ip + , ...f iq can be reduced to e. Otherwise we can use property A2. Then the claim is that 
^ (^k' fii •■•^p^k 2 ^p + i •••^q) = ^- This * s c l ear when £1 and £2 have the same sign since in this case the 
degree of fk does not equal modulo 3. A more difficult case is when £1 = 1 and £2 = — 1. (The case 
with opposite signs is similar.) However, in this case we can conclude that fkfi, ...fipfj^fi + , •••^q ^ e 
by an application of Greendlinger's lemma. Indeed, the only subwords that this word can contain and which 
would also be subwords of an element of R, are subwords of length 1 and 2. But these subwords fail 
to satisfy the requirement of either (2) or (4) in Greendlinger's lemma. Therefore, we can conclude that 
fkfi, •••fi p fk 1 fi p+ , •••fi q 7^ 2, and therefore A(3) is also satisfied. Thus Condition A is satisfied by random 
variables Xi , Xk, ... in algebra A, although these variables are not free. QED. 



4 Proof of the Main Result 

Outline of Proof: Our proof of the free CLT proceeds along the familiar lines of the Lindeberg method. 
We take a family of functions, {f} , and compare Ef (Sn ) with Ef (Sn^J , where Sn = Xi + ... + Xn and 

Sn = Yi + ... + Yn , and are free semicircle variables chosen in such a way that Var (Sn ) = Var ^Sn ) • 



To estimate 



Ef (Sn ) — Ef 



(Sn) 



, we substitute the elements in Sn with free semicircle variables, one by 

one, and estimate the corresponding change in the expected value of f (Sn )■ After that, we show that the 
total change, as all elements in the sum are substituted with semicircle random variables, is asymptotically 
small as N — > 00. Finally, the tightness of the selected family of functions allows us to conclude that the 
distribution of Sn must converge to the semicircle law as N — > 00. 

The usual choice of functions f in the classical case are functions from (K) , that is, functions with 
a continuous third derivative and compact support. In the non-commutative setting this family of functions 
is not appropriate because the usual Taylor series formula is difficult to apply. Intuitively, it is difficult to 
develop f (X + fi) in a power series of h. if variables X and fi do not commute. Since the Taylor formula is 
crucial for estimating the change in Ef (Sn ), we will still use it but we will restrict the family of functions 
to polynomials. 

To show that the family of polynomials is sufficiently rich for our purposes, we use the following Propo- 
sition: 

Proposition 10 Suppose there is a unique distribution function F with the moments {m' r ' , r > 1 } . Suppose 
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that {Fn} is a sequence of distribution functions, each of which has all its moments finite: 



x r dF 



N • 



Finally, suppose that for every r > 1 : 



Then Fn — ) F vaguely. 



lim ml/ — m (r ' 

n— > oo 



See Theorem 4. 5. 5. on page 99 in [3] for a proof. Note that Chung uses words "vague convergence" to 
denote that kind of convergence which is more often called the weak convergence of probability measures. 

Since the semicircle distribution is bounded and therefore is determined by its moments (see Corollary 
to Theorem II. 12.7 in 1 8 ]), therefore the assumption of Proposition[10]is satisfied, and we only need to show 
that the moments of S n converge to the corresponding moments of the semicircle distribution. 

Proof of Theorem |H Define r) i as a sequence of random variables that are freely independent among 
themselves and also freely independent from all f,^. Suppose also that rii have semicircle distributions with 
Er^ = and Er|? = cr?. We are going to accept the fact that the sum of free semicircle random variables is 

semicircle, and therefore, the spectral distribution of (t|i + ... + r| N ) / ^s\/N^ converges in distribution to 

the semicircle law \i s c with zero expectation and unit variance. Let us define Xi = £,i/sn and Yi = r| t / Sn ■ 
We will proceed by proving that moments of Xi + ... + Xn converge to moments of Yi + ... + Yn and 
applying Proposition[lO] Let 

Af = Ef(Xi + ... + X N ) — Ef (Yi +... + Y N ), 

where f (x) = x m . We want to show that this difference approaches zero as N grows. 
By assumption, EY t = EX t = and EY? = EX? = cr?/s N . 
The first step is to write the difference Af as follows: 

Af = [Ef(Xi +... + X N -i +X N )-Ef (Xi +...+X N -i +Y N )] 
+ [Ef (Xi + ... + X N -i + Y N ) - Ef (Xt + ... + Yn— i + Y N )] 
+ [Ef (Xi +Y 2 + ...+Y N _! +Y N )-Ef(Yi +Y 2 + ... + Y N _! +Y N )]. 

We intend to estimate every difference in this sum. Let 

Z k =Xi +...+X k _i +Y k+1 +...+Y N . (2) 

We are interested in 

Ef (Z k + X k )-Ef (Z k + Y k ). 

We are going to apply the Taylor expansion formula but first we define directional derivatives. Let 
f Xk (Z k ) be the derivative of f at Z k in direction X k , defined as follows: 

p l7 s r f(Z k +tX k )-f(Z k ) 
fx k (Z k )=:hm - t . 

The higher order directional derivatives can be defined recursively. For example, 

f Xk (Z k ) =: (f x J Xk (Zk) = hm . (3) 
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For polynomials, this definition is equivalent to the following definition: 



,„ (7 . ... f(Z k + tX k )-f(Z k )-tf Xk (Z k ) 

f Xk (Z k ) = 2hm - 2 s . (4) 
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Example 11 Operator directional derivatives off (x) — x 

Let us compute f' x (Z) and f x (Z) for f (x) = x 4 . Using definitions we get 

f' x (Z) = Z 3 X + Z 2 XZ + ZXZ 2 + XZ 3 

and 

f x (Z) = 2 (Z 2 X 2 + ZXZX + XZ 2 X + ZX 2 Z + XZXZ + X 2 Z 2 ) , (5) 
and the expression for f x (Z) does not depend on whether definition (O or (0]i was applied. 

The derivatives of f at Z k + xX k in direction X k are defined similarly, for example: 

f x ' k (Z k +xX k ) 

rv f (Z k + (t + t) X k ) - f (Z k + xX k ) - tf' (Z k + xX k ) - lt 2 f Xk (Z k + xX k ) 
= 6 hm = — . 

tj.o t 3 

Next, let us write the Taylor formula for f (Z k + X k ): 



K„ - - 1 rl 



f (Z k + X k ) = f (Z k ) + f' Xk (Z k ) + -f Xk (Z k ) + 



1 -x) 2 f x ' k (Z k +xX k )dx. (6) 



2 A k 1 *- J 2 

Formula (O can be obtained by integration by parts from the expression 

r l 

f(Z k + X k )-f(Z k ) = f' Xk (Z k + xX k )dx. 
- o 

For polynomials it is easy to write the explicit expressions for f x (Z k ) or f x (Z k + xX k ) although they 
can be quite cumbersome for polynomials of high degree. Very schematically, for a function f (x) = x m , 
we can write 

f Xk (Z k ) = X k Z--' + Z k X k Z]^ 2 + ... + Z-- 1 X k) (7) 

and 

f Xk (Z k ) = 2 (X 2 Z^- 2 + X k Z k X k Z^- 3 + ... + Z-- 2 X 2 ) , (8) 

Similar formulas hold for fy (Z k ) and fy (Z k ) , with the change that Y k should be used instead of X k . 

Using the assumptions that sequence {X k } satisfies Condition A and that variables Y k are free, we can 
conclude that Ef Yk (Z k ) = Ef x (Z k ) = OandthatEf Yk (Z k ) = Ef Xk (Z k ) . Indeed, consider, for example, 
(O. We can use expression (|2]i for Z k and the free independence of Yt to expand ^ as 

Ef Xk (Z k ) = Y_ c « p « ( E (X k X7X k Xl) , E (X k XjX k )Q) , ...) , (9) 

a 

where X| denotes certain monomials in variables Xi X k _i (i.e.,Xi = X^, ...Xi v withi k e {1,...,k— 1}), 
and where a indexes certain polynomials P a . In other words, using the free independence of Yt and X^ we 
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expand the expectations of polynomial (Zk) as a sum over polynomials in joint moments of variables 
Xj and Yi where ) = 1 , k and i = k + 1 , N. By freeness, we can reduce the resulting expression so 
that the moments in the reduced expression are either joint moments of variables Xj or joint moments of 
variables Yi but never involve both Xj and Yi. Moreover, we can explictly calculate the moments of Yi (i.e., 
expectations of the products of Yi) because their are mutually free. The resulting expansion is ©. 

Let us try to make this process clearer by an example. Suppose that f (x) = x 4 , N = 4, k = 2 and 
Zk = Z2 = Xi + Y3 + Y4. We aim to compute Ef^ (Z2) • Using formula (|5j, we write: 



Ff" 
LT x 2 



Z 2 ) = 2E (ZfX 2 . + •••) 

= 2e((Xi + Y 3 +Y 4 ) 2 X 2 



E(X 1 Y 4 X| 



= 2{E(xfxf) +E(X 1 Y 3 X^) 

+E (Y3X1 X|) + E (Y|X|) + E (Y 3 Y 4 X 2 ) 

+E (Y^X 2 ) + E (Y 4 Y 3 X 2 ) + E (Y 2 X 2 ) + ...}. 

Then, using the freeness of Y3 and Y4 and the facts that E (Yi) = and E (Y?) = o~ 2 , we continue as follows: 

2{E (XfXl) + ofE (Xj) + cr 2 E (x|) + ...}, 



which is the expression we wanted to obtain. 

It is important to note that the coefficients c a do not depend on variables Xj but only on Yj , ) > k, and 
on the locations which Yj take in the expansion of (Zk) ■ Therefore, we can substitute Yk for Xk and 
develop a similar formula for Ef y ( Zk ) : 



(Z k ) = Y_ C « P « ( E (YkXi Y k X 2 ) ,E (YkX 3 YkX 4 ) , . 



(10) 



In the example above, we will have 

Ef^ 2 (Z 2 ) = 2{E (X?Y 2 2 ) + cr^E (Yf) + cr^E (Y 2 2 ) + ...}. 

Formula ( fTOb is exactly the same as formula (O except that all Xk are substituted with Yk. Finally, using 
Condition A we obtain that for every i: 

e (YkX7Y k x77i) = e (yl) e (x~) e (xTTT) 

/2 \ r /v~ 



E(x2)E(Xi)E(X i+1 ) 



— E (X k XiXkXi 



and therefore Efy k (Zk) 
Consequently, 



Ef(Z k 

1 H 

2 



Xk)-Ef(Z k + Y k ) 
1 -T) 2 Ef£ k (Z k +TX k )dT- 



(1 



■ t) 2 Ef(! 



(Zk+xYkldT. 



Next, note that if f is a polynomial, then f^' (Zk + tXk) is the sum of a finite number of terms which 
are products of Zk + ^Xk and Xk - The number of terms in this expansion is bounded by Ci , which depends 
only on the degree m of the polynomial f . 
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A typical term in the expansion looks like 



E (Z k + TX k ) m - 7 K (Z k + xX k ) 3 X k . 

In addition, if we expand the powers of Z k + TX k , we will get another expansion that has the number of 
terms bounded by C2 , where C2 depends only on m. A typical element of this new expansion is 

E(Z-- 7 X k Z k X k ). 

Every term in this expansion has a total degree of X k not less than 3, and, correspondingly, a total degree of 
Z k not more than m — 3. Our task is to show that as n — > 00, these terms approach 0. 

We will use the following lemma to estimate each of the summands in the expansion of i'V (Z k + xX k ). 

Lemma 12 Let X and Y be self-adjoint. Then 



< 



E(x m, Y n 1 _ X m rY n r 

2~ T r 

E (X 



(y 2T -') 



EX 



/2 r m T 



E Y 



Proof: For r = 1 , this is the usual Cauchy-Schwartz inequality for traces: 

|E (X mi Y n ' )| 2 < E (X 2m > ) E (Y 2ni ) . 

See, for example, Proposition 1.9.5 on p. 37 in [9 |. 

Next, we proceed by induction. We have two slightly different cases to consider. Assume first that r is 
even, r = 2s. Then, by the Cauchy-Schwartz inequality, we have: 

|E(X m, Y n '...X m 'Y n ')| 2 
< E(X m 'Y ni X ms Y ns Y ns X ms Y n ' X mi ) E (Y t1t X 1TIt ^s + ix^ + 'X^ + 'Y 11 " 1 x^Y 11 ') 
_ ■£ ^2mi yrt] x ms Y 2ris X ms Y n ' ) E (Y 2rLr X mT Y ns + ' x 2rrLs + 1 Y ns + 1 X mT ) 

Applying the inductive hypothesis, we obtain: 



E (X m ' Y n ' x Trir Y nr 

2 — T + 1 



< 



E 



2'm, 



(x 2 ' m ^ 



E 

2-r+l 



E 

2 -t+i 



E 

2-r + l 



We recall that by the Lyapunov inequality, 
desired inequality: 



E(Y 2r ~'-i 



< [E (Y 2 ^ 1 )] 2 and we get the 



E(X mi r'...X m 'Y' 

2~ r r 



< 



EX 



E Y 



/2 T n, 



(x 2T ^) 



2~ r 



E ( Y 2 ^ 



Now let r be odd, r = 2s + 1 . Then 



|E(X m, Y ni ...X m 'Y n ')r 

< E(X m, Y ni Y ns X ms + ' X ms + 1 Y ns Y ni X m ' ) E (Y nT X mr x ms+2 Y n!! + ' Y ns + ' X ms+2 x mT Y nT 
_ £fx 2m, Y ni Y ns X 2tRs + ' Y ns Y n M E fY 2nT X mT x ms+2 Y 2ns + ' X ms + 1 X mT ^ 
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After that we can use the inductive hypothesis and the Lyapunov inequality and obtain that 



E(X m 'Y n \..X m 'Y n 

2 - 



< 



EX 



/2 T m 1 



EX 



/2 r m r 



QED. 

We apply Lemma[T2lto estimate each of the summands in the expansion of f ^ (Z^ + tXiJ. Consider a 
summand E (Z^ 1 X£' ...Z^ r X£ r ) . Then by LemmaE] we have 



|E(ZrX-'...Z^X^ 



(11) 



< 



E Z 



,2 T mi 



EX 



E Z 



Next step is to estimate the absolute moments of the variable Z^. 



Lemma 13 Let Z = (vi + ... +vn) /N 1 / 2 , where vt are self-adjoint and satisfy condition A(l) and let 
E l v i| k M-k/ oreve O' i- Then, for every integer r > 

E(|Z| r ) =0(1) oiN^oo. 

Proof: We will first treat the case of even r. In this case, E (|Z| r ) = E (Z r ) . Consider the expansion 
of (vi + ... + vn) t . Let us refer to the indices 1 , N as colors of the corresponding v. If a term in the 
expansion includes more than r/2 distinct colors, then one of the colors must be used by this term only once. 
Therefore, by the first part of condition A the expectation of such a term is 0. 

Let us estimate a number of terms in the expansion that include no more than r/2 distinct colors. Con- 
sider a fixed combination of < r/2 colors. The number of terms that use colors only from this combination 
is < (r/2) T . Indeed, consider the product 

(vi + ... + Vim ) (vi + ... + vim ) ... (vi + ••■ +vn) with r product terms. We can choose an element from the 
first product term in r/2 possible ways, an element from the second product term in r/2 possible ways, etc. 
Therefore, the number of all possible choices is (t/2) t . On the other hand, the number of possible different 
combinations of k < r/2 colors is 

(N — k) !k! ~ 

Therefore, the total number of terms that use no more than r/2 colors is bounded from above by 

(r/2) r N r/2 . 

Now let us estimate the expectation of an individual term in the expansion. In other words we want to 
estimate E ^v^ 1 ■■•vJ Cs ^ , where k t > 1 , k] + ... + k s = r, and i t ^ it+i • First, note that 



EK'...v: 



< E 



Indeed, using the Cauchy-Schwartz inequality, for any operator X we can write 

2 



[E(x)r = 



E(U|X| ,/2 |X| ,/2 
E(|X1P)E(|X|), 



< E 



(|X| 1/2 U*U|X| 1/2 ) E(JX| 1/2 |X| 



1/2 
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where U is a partial isometry and P = U*U is a projection. Note that from the positivity of the expectation 
functional it follows that E (|X| P) < E (|X|) . Therefore, we can conclude that |E (X)| < E (|X|) . 

Next, we use the Holder inequality for traces of non-commutative operators (see [4 |, especially Corollary 
4.4(iii) on page 324, for the case of the trace in a von Neumann algebra and Section III. 7. 2 in [5 1 for the case 
of compact operators and the usual operator trace). Note that 



1 



l-i. 

s 



therefore, the Holder inequality gives 



(|v^;...v^|)<[E(|v il | k -)...E(|v i J k -) 



1/s 



Using this result and the uniform boundedness of the moments (from assumption of the lemma), we get: 



log 



< -^log H kiS 



i=1 



Without loss of generality we can assume that bounds (j. k are increasing in k. Using the facts that s < r and 
lq < r, we obtain the bound: 

log E(v£...v£ sN ) <log^ 2 , 

< IV- 

E(vi + ...+v N ) r < (r/2)> r2 N r / 2 , 



E 


«' 


•■<•) 








E 


«' 


■■*) 









Therefore, 



and 



E(Z r ) < (r/2)V r2 . 

Now consider the case of odd r. In this case, we use the Lyapunov inequality to write: 



Eizr < 



< 



(E|Z| r+1 )" 



(12) 



(13) 



1 



2 

r + 1 



r+1 



Mfr+1) 



The important point is that the bounds in (fT2l and dT3l l do not depend on N . QED. 

By definition Z k = (£,i + ... + ^k-i +Tl k+ i + ... +T| N ) /sn and by assumption and r| i are uni- 
formly bounded, and sn ~ Moreover, £,] £,k-i satisfy Condition Aby assumption, and ri k+1 , ...,r| N 

are free from each other and from £,i , £, k _i . Therefore, by Proposition|4] £,i , £, k _i ,T) k+] r\ N sat- 
isfy condition A(l). Consequently, we can apply Lemma [T3l to Z k and conclude that E |Z k | r is bounded by 
a constant that depends only on r but does not depend on N . 
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Using this fact, we can continue the estimate in ( fTTT i and write: 



< c 4 



EX 



/2 T n, 



where the constant C4 depends only on m. 
Next we note that 



EX 



,2 r n. 



< C 



+ n T > 3; therefore we can write 

2~ r 



= c 



N-n-i/2 



EX 



Next note that n.] 



In sum, we obtain the following Lemma: 
Lemma 14 



EX 



2 r n T 



< C'N- 3/2 . 



te (z k + xx k ) < c 5 n 



-3/2 



where C5 depends only on the degree of polynomial f awe/ f/ze sequence of constants 
A similar result holds for Ef^ (Z k + TY k ) | and we can conclude that 

|Ef(Z k + X k )-Ef(Z k + Y k )|<C 6 N- 



-3/2 



(14) 



After we add these inequalities over all k = 1 , N we get 

|Ef(Xi +... + X N )-Ef(Yi +... + Y N )| < C 7 N- 1/2 . 

Clearly this estimate approaches as N grows. Applying Proposition \W[ we conclude that the measure of 
X] + ... + Xn converges to the measure of Y] + ... + Yn in distribution. This finishes the proof of the main 
theorem. 



5 Concluding Remarks 

The key points of this proof are as follows: 1) We can substitute each random variable Xt in the sum Sn 
with a free random variable Y^ so that the first and the second derivatives of any polynomial with Sn in the 
argument remain unchanged. The possibility of this substitution depends on Condition A being satisfied by 
Xi. 2)We can estimate a change in the third derivative as we substitute Yi for Xt by using the first part of 
Condition A and several matrix inequalities, valid for any collection of operators. Here Condition A is used 
only in the proof that the k-th moment of (£,1 + ... + £, N ) /N 1 ^ 2 is bounded as N — > 00. 

It is interesting to speculate whether the ideas in this proof can be generalized to the case of the multi- 
variate CLT 
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